Q Learning based Reinforcement Learning Approach to Bipedal Walking Control
نویسندگان
چکیده
Reinforcement learning has been active research area not only in machine learning but also in control engineering, operation research and robotics in recent years. It is a model free learning control method that can solve Markov decision problems. Q-learning is an incremental dynamic programming procedure that determines the optimal policy in a step-by-step manner. It is an online procedure for learning the optimal policy through experience gained solely on the basis of samples. A Q learning based reinforcement learning of a double inverted pendulum has been shown in this paper which reaches a limit cycle at the end of several learning cycles. The double inverted pendulum becomes stable, since the pole angle and pole angular velocity become zero. Stabilization of an equivalent double inverted pendulum representing a bipedal robot has been successfully implemented for balancing the pole angles in the required range using Q learning in Reinforcement Learning. Keywords—Q learning; Double inverted pendulum; Limit Cycle.
منابع مشابه
Dynamic bipedal walking assisted by learning
This thesis proposes a general control architecture for 3D dynamic walking. It is based on a divide-and-conquer approach that is assisted by learning. No dynamic models are required for the implementation. In the approach, the walking task is divided into three subtasks: 1) to maintain body height; 2) to maintain body posture; and 3) to maintain gait stability. The first two subtasks can be ach...
متن کاملFeedback Control For Cassie With Deep Reinforcement Learning
Bipedal locomotion skills are challenging to develop. Control strategies often use local linearization of the dynamics in conjunction with reduced-order abstractions to yield tractable solutions. In these model-based control strategies, the controller is often not fully aware of many details, including torque limits, joint limits, and other non-linearities that are necessarily excluded from the...
متن کاملReinforcement learning based feedback control of tumor growth by limiting maximum chemo-drug dose using fuzzy logic
In this paper, a model-free reinforcement learning-based controller is designed to extract a treatment protocol because the design of a model-based controller is complex due to the highly nonlinear dynamics of cancer. The Q-learning algorithm is used to develop an optimal controller for cancer chemotherapy drug dosing. In the Q-learning algorithm, each entry of the Q-table is updated using data...
متن کاملA dynamically-Balanced Walking Biped
Describes the mechanical, electronic and software design of a 10-DOF bipedal robot which has been constructed to study control, parameterisation and automatic expansion of the stability envelope of a complex real-time behaviour, namely, dynamically-balanced two-legged walking. The machine is physically complete and demonstrates reasonable reliability in movement control including dynamically-ba...
متن کاملDOCTORAL THESIS PROPOSAL Biped Locomotion: Augmenting an Intuitive Control Algorithm with Learning
Foot placement is a key determinant for the stabilization of walking speed and lateral motion of a biped. However, there is no closed form expression for the foot placement parameters in term of the walking speed or other gait parameters. A simple and intuitive control algorithm (called “Turkey Walking”) based on Virtual Model Control (VMC) was successfully applied to planar bipedal walking. Ho...
متن کامل